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1 XML access control: Access control of XML documents considering update 
<^ o perations 

^ Chung-Hwan Lim, Seog Park, Sang H. Son 

October 2003 Proceedings of the 2003 ACM workshop on XML security 

Publisher: ACM Press 

Full text available: 'Q pdf(298.78 KB) Additional Information: full citation , abstract , references , index terms 

As a large quantity of information is presented in XML format on the Web, there are 
increasing demands for XML security. Until now, research on XML security has been 
focused on the security of data communication using digital signatures or encryption 
technologies. As XML is also used for a data representation of data storage, XML security 
comes to involve not only communication security but also managerial security: 
Managerial security is guaranteed through access control, but existing XML acces ... 



Keywords: XML document, XML update, access control 



2 On XML integrity constraints in the presence of DTDs I 
A. Wenfei Fan, Leonid Libkin 

^ May 2002 Journal of the ACM (JACM), volume 49 issue 3 
Publisher: ACM Press 

^ „ . ^ u, « ^^/ono C7 un^ Additional Information: full citation , abstract , references , citings, index 
Full text available: g pdf(398.67 KB) ^^^^^ 

The article investigates XML document specifications with DTDs and integrity constraints, 
such as keys and foreign keys. We study the consistency problem of checking whether a 
given specification is meaningful: that is, whether there exists an XML document that both 
conforms to the DTD and satisfies the constraints. We show that DTDs interact with 
constraints in a highly intricate way and as a result, the consistency problem in general is 
undecidable. When it comes to unary keys and foreign keys ... 

Keywords: Consistency, DTDs, XML, implication, integrity constraints 



^ Ty ping and querying XML docunnents: some complexity bounds 

#Luc Segoufin 
June 2003 Proceedings of the twenty-second ACM SXGMOD-SZGACT-SIGART 
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symposium on Principles of database systems 

Publisher: ACM Press 

CT II* ^ I ui 0 i^Dx Additional Information: full citation , abstract , references , citin gs, index 

Full text available: TO pdf(246.23 KB) ^ ^ 

*^ terms 

We study the complexity bound of validating XML docunnents, viewed as labeled unranked 
ordered trees, against various typing systems like DTDs, XML schemas, tree automata ... 
We also consider query evaluation complexities for various fragments of XPath. For both 
problems, validation and query evaluation, we consider data and combined complexity 
bounds. 

DTD inference for views of XML data 
Yannis Papakonstantinou, Victor Vianu 

May 2000 Proceedings of the nineteenth ACi^ SIGMOD-SIGACT-SIGART symposium 

on Principles of database systems 
Publisher: ACM Press 

.- .1 X ^ M u. 01 ^€i^Ai i^Dx Additional Information: full citation , abstract , references , citings, index 
Full text available: TO pdf(347.61 KB) ; 

terms 

We study the inference of Data Type Definitions (DTDs) for views of XML data, using an 
abstraction that focuses on document content structure. The views are defined by a query 
language that produces a list of documents selected from one or more input sources. The 
selection conditions involve vertical and horizontal navigation, thus querying explicitly the 
order present in Input documents. We point several strong limitations in the descriptive 
ability of current DTDs and the need for extendi ... 

Data integration and sharing I: Exchanging intensional XML data 

Tova Milo, Serge Abiteboul, Bernd Amann, Omar Benjelloun, Fred Dang Ngoc 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Additional Infomiation: full citation, abstract , references , citing s, index 



Full text available: • — - 

terms 

XML is becoming the universal format for data exchange between applications. Recently, 
the emergence of Web services as standard means of publishing and accessing data on 
the Web introduced a new class of XML documents, which we call intensional documents. 
These are XML documents where some of the data is given explicitly while other parts are 
defined only intensionally by means of embedded calls to Web services. When such 
documents are exchanged between applications, one has the choice to ... 

Re-en g ineering structures frona Web documents 
Chuang-Hue Moh, Ee-Peng Lim, Wee-Keong Ng 

June 2000 Proceedings of the fifth ACM conference on Digital libraries 
Publisher: ACM Press 

^ -. ui 0 ^^/-lort nc LTDx Additional Information: full citation , abstract, references , citings, index 
Full text available: ^ pdfd 80.95 KB) 

To realize a wide range of applications (including digital libraries) on the Web, a more 
structured way of accessing the Web is required and such requirement can be facilitated 
by the use of XML standard. In this paper, we propose a general framework for reverse 
engineering (or re-engineering) the underlying structures i.e., the DTD from a collection of 
similarly structured XML documents when they share some common but unknown DTDs. 
The essential data structures and algorithms for ... 

Keywords: Web information discovery, XML 
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7 Views in a lar g e-scale XML repository 

Vincent Aguilera, Sophie Cluet, Tova Mile, Pierangelo Veltri, Dan Vodislav 

November 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, volume 11 Issue 3 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pclf(241.60 KB) Additional Information: full citation , abstract , citings , index terms 

We are interested In defining and querying views In a huge and highly heterogeneous XML 
repository (Web scale). In this context, view definitions are very large, involving lots of 
sources, and there is no apparent limitation to their size. This raises interesting problems 
that we address in the paper: (i) how to distribute views over several machines without 
having a negative impact on the query translation process; (ii) how to quicl<ly select the 
relevant part of a view given a query; (iii) how ... 

Keywords: Query evaluation. Semantic integration, Views, Warehouse, XI^L 



8 Session 4: Web service applications: Authenticating distributed data using Web 
services and XML signatures 
Daniel J. Polivy, Roberto Tamassia 

November 2002 Proceedings of the 2002 ACM workshop on XML security 
Publisher: ACM Press 

Full text available: ^p df ( 164.09 KB) Additional Information: full citation , abstract, references, index terms 

As the need for digital data becomes more ubiquitous, so does the need to provide 
efficient mechanisms for distributing and verifying the authenticity of that data. We 
present an architecture for authenticating responses to queries from untrusted mirrors of 
authenticated dictionaries using Web Services and XML Signatures. We also describe an 
implementation of our scheme for the Secure Transaction Management System. 

Keywords: Web services, XML, authentication, digital signatures 




9 Advanced XML technologies and applications: Consistently updating XML documents 
using incremental constraint check queries 
Bintou Kane, Hong Su, Elke A. Rundensteiner 

November 2002 Proceedings of the 4th international workshop on Web information 
and data management 

Publisher: ACM Press 

Full text available: ^ pdf(399.07 KB) Additional Information: full citation, abstract , references , index terms 

When updating a valid XML document, an efficient yet light-weight mechanism is needed 
to determine if the up-date would invalidate the document. Towards this goal, we 
developed a framework called SAXE, we first analyzed the constraints expressed in XML 
schema specifications and establish constraint rules that must be observed for an XML 
document to conform to a given XML Schema. We then classify the rules as relevant for a 
given update case, that is, we show the minimal set of rules that must be ... 




Keywords: XML schema, XML update, XQuery 



XDuce: A staticall y ty ped XML processinc language HI 
Haruo Hosoya, Benjamin C. Pierce 

May 2003 ACM Transactions on Internet Technology (TOIT), volume 3 issue 2 
Publisher: ACM Press 

Full text available: ^ p df (242.48 KB) Additional Information: full citation , abstract , references , citings , index 
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terms , review 

XDuce is a statically typed programming language for XML processing. Its basic data 
values are XML documents, and its types (so-calied regular expression types) directly 
correspond to document schemas. XDuce also provides a flexible form of regular 
expression pattern matching, integrating conditional branching, tag checking, and subtree 
extraction, as well as dynamic typechecking. We survey the principles of XDuce's design, 
develop examples illustrating its key features, describe i ... 

Keywords: Type systems, XML, subtyping, tree automata 



11 Research sessions 2 and 3: information processing on WWW and XML: Validatin g \ 
^ streaming XIVIL documents 

^ Luc Segoufin, Victor Vianu 

June 2002 Proceedings of the twenty-first ACM SIGI^OD-SIGACT-SIGART symposium 

on Principles of database systems 
Publisher: ACIVI Press 

r- ^ •. ui 01 ^r/n^oo^LTDv Additional Information: fuH citation , abstract , references , citings , index 

Full text available: pdf(243.31 KB) ^ 

*^ terms 

This paper investigates the on-line validation of streaming XML documents with respect to 
a DTD, under memory constraints. We first consider validation using constant memory, 
formalized by a finite-state automaton (FSA). We examine two flavors of the problem, 
depending on whether or not the XML document is assumed to be well-formed. The main 
results of the paper provide conditions on the DTDs under which validation of either flavor 
can be done using an FSA. For DTDs that cannot ... 

12 Technical correspondence: Efficient formalism-only parsing of XML/HTML using the 

^ §-calculus 

^ Quinn Tyler Jackson 

February 2003 ACM SIGPLAN Notices, volume 38 issue 2 

Publisher: ACM Press 

Full text available: Ipl pdf(2.57 MB) Additional Information: full citation , abstract , references 



Traditionally, correct parsing of XML and HTML has been littered witli semantic hacl<s in 
the parsing code to deal with the oddities of these languages, since HTML accepts 
unbalanced tags and tags that do not match In case, but XML is less forgiving. The 
detection of well-formedness of XML documents has, to date, required semantic analysis 
outside of the grammar specification. We present a grammar-only (HT\X)ML parser which, 
upon detecting that it is parsing XML, modifies itself dynamically In ord ... 

Extending Java for hi g h-level Web service construction 
Asl<e Simon Christensen, Anders M0ller, Michael 1. Schwartzbach 
November 2003 ACM Transactions on Programming Languages and Systems 

(TOPLAS), Volume 25 Issue 6 
Publisher: ACM Press 

,- ^ •. u. A jr/r^^-rno Additional Information: full citation , abstract , references , citings , index 

Full text available: S pdf(947.02 KB) ^ 

terms 

We incorporate innovations from the <bigwig> project into the Java language to provide 
high-level features for Web service programming. The resulting language, JWIG, contains 
an advanced session model and a flexible mechanism for dynamic construction of XML 
documents, in particular XHTML. To support program development we provide a suite of 
program analyses that at compile time verify for a given program that no runtime errors 
can occur while building documents or receiving form input, and ... 

Keywords: Interactive Web services, XML, data-flow analysis 
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14 Data inte g ration and sharing I: Capturing both types and constraints in data | 
^ integration 

^ Michael Benedikt, Chee-Yong Chan, Wenfei Fan, Juliana Freire, Rajeev Rastogi 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 

Management of data 
Publislier: ACM Press 

.. * ^ -I ui 0t Atfann co i/d\ Additional Information: full citation , abstract , references , citings, index 
Full text available: ^ pdf(690.62 KB) terms 

We propose a framework for integrating data from multiple relational sources into an XML 
document that both conforms to a given DTD and satisfies predefined XML constraints. 
The framework is based on a specification language, AIG, that extends a DTD by (1) 
associating element types with semantic attributes (inherited and synthesized, inspired by 
the corresponding notions from Attribute Grammars), (2) computing these attributes via 
parameterized SQL queries over multiple data sources, and (3) inc ... 

15 Session 1: encryption: A stream-based implennentation of XML encryption 

Tal<eshi Imamura, Andy Clark, Hiroshi Maruyama 

November 2002 Proceedings of the 2002 ACM workshop on XML security 
Publisher: ACM Press 

Full text available: gpdf(93.90 KB) Additional Information: full citation , abstract , references, index terms 

W3C has been working on the standardization of XML Encryption and released its 
specification as a W3C Proposed Recommendation in 2002. There are several 
implementations of the specification, all of which are implemented using DOM. However, 
it is commonly accepted that DOM has higher costs in time and space than other APIs. 
Also, even if SAX is used, with this kind of API, it is impossible to parse decrypted data 
both efficiently and correctly. Therefore, we thought of using the Xerces Native Int ... 

Keywords: XML, encryption, stream-based processing 



CDuce: an XML-centric g eneral-purpose lan guage 
Veronique Benzaken, Giuseppe Castagna, Alain Frisch 

August 2003 ACM SIGPLAN Notices , Proceedings of the eighth ACM SIGPLAN 

international conference on Functional programming ZCFP '03, volume 38 

Issue 9 

Publisher: ACM Press 

^ MX ^ u. 0 -ir/o^o i^Dx Additional Information: full citation , abstract, references , cltjnfls. index 
Full text available: ^ pdf(242.16 KB) ^^^^ 

We present the functional language CDuce, discuss some design issues, and sfiow its 
adequacy for worl<ing with XML documents. Distinctive features of CDuce are a powerful 
pattern matching, first class functions, overloaded functions, a very rich type system 
(arrows, sequences, pairs, records, intersections, unions, differences), precise type 
inference for patterns and error localization, and a natural interpretation of types as sets 
of values. We also outline some important implementation issue ... 

Keywords: CDuce, XML, XML-processing, type systems 



17 Re g ular expression t y pes for XML 
^ Haruo Hosoya, Jerome Vouillon, Benjamin C. Pierce 

^ September 2000 ACM SIGPLAN Notices , Proceedings of the fifth ACM SIGPLAN 

international conference on Functional programming ICFP '00, volume 
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35 Issue 9 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 



Full text available: ' , 

^ terms 

We propose regular expression types as a foundation for XML processing languages. 
Regular expression types are a natural generalization of Document Type Definitions 
(DTDs), describing structures in XML documents using regular expression operators (i.e., 
*, ?, I, etc.) and supporting a simple but powerful notion of subty ping. The decision 
problem for the subtype relation is EXPTIME-hard, but it can be checked quite efficiently 
In many cases of practical interest. The subtyping algori ... 

18 Representin g and queryin g XML with incomplete information 
^ Serge Abiteboul, Luc Segoufin, Victor Vianu 

NT May 2001 Proceedings of the twentieth ACM SIGMOD-SZGACT-SIGART symposium 
on Principles of database systems 

Publisher: ACM Press 

Full text available: « pdf(22627KB) Additional Information: full citation , abstract, references , ciflngs. index 

We study the representation and querying of XML with incomplete information. We 
consider a simple model for XML data and their DTDs, a very simple query language, and 
a representation system for incomplete information In the spirit of the representations 
systems developed by Imielinski and Lipski for relational databases. In the scenario we 
consider, the incomplete information about an XML document Is continuously enriched by 
successive queries to the document. We show that our representation ... 

19 Document querying and transformation: Lazy XSL transformations 
^ Steffen Schott, Markus L. Noga 

^ November 2003 Proceedings of the 2003 ACM symposium on Document engineering 
Publisher: ACM Press 

Full text available: ^ p df (335.83 KB) Additional Information: full citation , abstract , references , index terms 

We introduce a lazy XSLT interpreter that provides random access to the transformation 
result. This allows efficient pipelining of transformation sequences. Nodes of the result 
tree are computed only upon initial access. As these computations have limited fan-in, 
sparse output coverage propagates backwards through the pipeline. In comparative 
measurements with traditional eager Implementations, our approach is on par for 
complete coverage and excels as coverage becomes sparser. In contrast to eag ... 

20 XML with data values: typecheckin g revisited 
Noga Alon, Tova Milo, Frank Neven, Dan Suciu, Victor Vianu 

May 2001 Proceedings of the twentieth ACi^ SIGMOD-SIGACT-SIGART symposium 
on Principles of database systems 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citing s, index 



Full text available: .^. r^^.,-, 

terms 

We investigate the type checking problem for XML queries: statically verifying that every 
answer to a query conforms to a given output DTD, for inputs satisfying a given Input 
DTD. This problem had been studied by a subset of the authors In a simplified framework 
that captured the structure of XML documents but ignored data values. We revisit here 
the type checking problem in the more realistic case when data values are present in 
documents and tested by queries. In this extended frame ... 
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